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Identifying network routers and paths 

_ [30020466-2] 

Technical Field 

[0001] This invention relates to methods and apparatus for identifying routers, the 
5 associated interconnecting communications links and the paths taken by packets traversing 
those links in a communications network, such as a packet network using the Internet 
Protocol (IP). The invention is particularly applicable to networks that use link-state routing 
protocols such as Open Shortest Path First (OSPF) or Intermediate system-Intermediate 
system (IS-IS), or any equivalent thereof 

10 

Background Art 

[0002] In order to distinguish themselves from their competitors and improve levels of 
service to customers without compromising existing cost structures or capital budgets, 
Internet service providers (ISPs) are increasingly employing cost optimisation, service 

15 enhancement or service differentiation mechanisms to implement "traffic management" 
within their networks. These mechanisms include traffic engineering (described below), 
quality of service (QoS) measurements and service level agreements (SLAs). There are a 
variety of technologies that can help operators implement these "traffic-managed" networks. 
In the case of IP networks these include Multi-Protocol Label Switching (MPLS), see for 

20 example Request for Comments (RFC) 3031 of the Internet Engineering Task Force, and 
Differentiated Services, see for example RFCs 2474 and 2475. 

[0003] A common theme among these technologies is their use of packet classification at 
the ingress point where a data packet first enters a discrete network (generally referred to in 
Internet tenninology as an autonomous system). Conversely, the same packet will often be 

25 declassified at the egress point of that network so that the next network/autonomous system 
to receive the packet can, if it wishes, reclassify the packet in its own way. The classification 
ensures each packet receives the appropriate treatment when routed through a network. The 
treatment that a packet will receive as it passes through the network will differ depending 
upon the type of classification given to the packet at the ingress router. 

30 [0004] For each classified packet, the intermediate routers coerce routing of the packet onto 
a different logical path through the network away from the predetermined default path that 
the packet would normally take if it were unclassified. At least one default path is defined 
for each source and destination within the network. This default path is typically the least- 
cost path as defined by the Interior Gateway Protocol (IGP) cost metric for each 

35 interconnection (described hereinafter in the context of OSPF with reference to Figures 4-7). 
[0005] A logical path is therefore an alternative non-default path taken by any packet that 
receives different routing (packet forwarding) treatment A logical path may for example be 
a separate physical path from the one that would typically be taken by the packet if it were 
unclassified. Similarly a logical path may be defined by different queuing treatment at the 
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intermediate routers. In either example, a classified packet will receive a different set of 
treatments, depending upon the classification received, giving the packet a different set of 
transmission characteristics as compared to the same packet were it routed on the default 
path. Each logical path has a set of assigned properties that determine the transmission 
5 characteristics for the packets that traverse the path, such as how much bandwidth on the 
physical interconnection is reserved for that logical path, the level of service ("bronze", 
"silver" or "gold"), the maximum permissible jitter, or any specific routers through which the 
logical path must pass. 

[0006] For example, a network operator applying traffic engineering may decide to transmit 

10 videoconference traffic that is sensitive to jitter via a dedicated logical path through its 
MPLS-enabled network. That path is different from other default paths which non- 
videoconference traffic will take. Despite having potentially more router hops the dedicated 
path (in this case a separate physical path) carries no other traffic and can therefore easily 
accommodate the combined voice and video load without introducing unwanted jitter. All 

15 other traffic takes the default path, e.g. the route with the smallest overall cost metric as 
defined by the IGP. Whichever route is taken, all traffic eventually arrives at the egress 
router and the packets are then declassified ready to be passed to the next network. Without 
this load balancing all network packets would be routed using the default path and at peak 
times this may cause the network to become overloaded and discard or delay packets, making 

20 the videoconference unusable and causing problems for other data traffic users. 

[0007] The traffic-engineering process can be applied at many different levels, for example 
for different customers, for different services or for combinations of both, Equally, other 
traffic-management tools such as QoS and SLA mechanisms that have different business 
objectives could be employed. Both QoS and SLAs require packet classification at the 

25 ingress and egress points and both result in other routing policies and the use of logical paths 
that are different from the default (usually the least-cost) path to route traffic concurrently 
within the network. 

[0008] Many traffic-engineering techniques involve monitoring of the network's operation, 
for example to audit conformance to agreed QoS or SLA criteria and to trigger timely 

30 remedial action or (less desirably) compensation if the criteria are not attained. A problem 
for network management systems attempting such monitoring of traffic-managed networks is 
to discover where packets enter and leave the network and whether the classification and 
subsequent treatment of the packet is correct. The network management system should 
detect incorrect packet classification which could cause traffic to be routed incorrectly, or 

35 failure of an internal router which could cause all traffic to follow the same path irrespective 
of classification, in either case resulting in packets being delayed or discarded and perhaps 
breaching an SLA. 

[0009] The overall Internet is divided into many administrative domains. For example, an 
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Internet service provider might constitute a single administrative domain. Each 
administrative domain forms part of the Internet by entering into agreements with 
neighbouring domains (other ISPs etc.) to form peering or transit relationships to carry each 
other's traffic and enable the connectivity expected by users. An administrative domain 
5 contains one or more autonomous systems (ASs). An AS is a set of routers typically under a 
single technical administration (e.g. an ISP), which: 

- appears externally to have a single coherent interior routing plan (using one and possibly 
several interior gateway protocols and one or more common metrics to route packets 
within the AS); 

10 - presents a consistent picture of what destinations are reachable through it; and 

- uses an exterior gateway protocol to route packets to other ASs. 

Hereinafter the word "network" is used in the context of the Internet to mean such an 
autonomous system. In the context of other kinds of communications system the word 
network should be understood as meaning an ensemble of operational elements which is 
75 analogous in concept and functionality to an Internet AS, whether the ensemble comprises 
the whole of the system or only part thereof. 

[0010] The Internet consists of many ASs in many administrative domains. At each 
connection between each AS there are "edge" routers and each edge router has the potential 
to implement some form of traffic management. A large ISP may have many ingress and 
20 egress routers interacting with many other ISPs and have many different end customers. 
Each ingress and egress router could be classifying and routing traffic using many different 
policies. The enormous challenges involved in deploying, monitoring and managing traffic- 
management technologies is readily apparent. 

[001 1] Having knowledge of the overall topology of the network (e.g. the identity of active 
25 edge routers and of intermediate routers which handle a packet traversing the network) is of 
considerable assistance in meeting these challenges. However, existing packet network 
technologies do not provide this knowledge in an explicit form that is easily accessible to 
external tools that could be used to facilitate traffic management. A system supplementary to 
the network itself that could assist in the challenges described would require topological 
30 information to be delivered from a potentially very large network in near real time and ideally 
without discernable impact upon existing network and router performance. Routers typically 
contain a complete database of router and link status in the network. This information is 
known as the link state database and is used to generate a routing table within each router to 
determine the optimum neighbouring router to which to forward a data packet towards its 
35 ultimate destination. The routing table is generated for example by means of the OSPF link- 
state protocol described in RFC 2328 (and referred to hereinafter as the OSPF protocol). The 
information contained within the router's link-state database describes the topology to an 
extent sufficient for that router's operational requirements; such data could in principle be 
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extracted piecemeal from the routers and be exploited to produce a complete topology 
description. Unfortunately, using current technologies the required data it is not accessible in 
a manner that satisfies the necessary requirements of scale, accuracy and timeliness whilst 
ensuring that network integrity is maintained. 
5 [0012] For example, although queries using the Simple Network Management Protocol 
(SNMP) could in theory be used to gather the required information, this approach is not well 
suited to use with large networks containing multiple routers. The large set of SNMP queries 
required when determining a complete topology for a network would place a large processing 
burden upon the routers and generate a substantial volume of network traffic. Furthermore, to 

10 avoid having to query each address on a network, most of which will be terminals such as 
personal computers or workstations, the router addresses would need to be known in 
advance, imposing a substantial administrative burden and compromising the benefits of 
autonomous discovery or verification of the network topology. Similar problems apply to 
extracting information from an operation support system (OSS) or other external data source; 

15 this information may not be available, may not be provided, or may be deemed too sensitive 
to permit retrieval via direct access. Furthermore, even if the information were available 
there is no guarantee, without independent verification, that the information is accurate. 
[0013] It is an object of this invention to facilitate the monitoring of traffic management, by 
assisting with the provision of descriptions of network topology. For example, a network 

20 topology description obtained by using the invention can assist network operators to 
administer networks deploying traffic management techniques such as MPLS and 
Differentiated Services, or can be used in deploying core MPLS-enabled IP networks (see 
RFC 2917), Voice over IP services (also known as Internet Telephony), SLAs and QoS 
mechanisms. In particular the invention facilitates monitoring of the different logical paths 

25 and any associated transmission characteristics implemented over the various physical 
interconnections, routers and sub-networks present in the network. 

Disclosure of Invention 

[0014] According to one aspect of this invention there is provided a method for identifying 
30 a network-wide set of paths potentially taken by packets in a communications network, 
comprising the steps of: 

collecting packets containing information indicative of the interconnection of the 
network, and of its interconnection with other networks; 
detecting the contents of the collected packets; 
35 using the detected contents to identify the network-wide set of routers and sub- 

networks and their interconnections, that are traversed by communications within the 
network; and 

providing an output indicative of any selected part of the network-wide set of routers 
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and sub-networks and their interconnections. 

[0015] "Network-wide" in this context means that the network description produced is not 
focused on any particular router or other node in the network. OSPF for example notionally 
produces in each router a tree description of paths through the network, with that router as the 
5 root of the tree, as a transient step towards generating a desired routing table. Paths between 
routers that are not needed to forward packets from this "root" router are not included in the 
tree. In contrast, the present invention generates a description of the network topology in 
which all routers are equally significant, and in a typical implementation provides a 
comprehensive view of all paths, not just the default path, between all routers. 

10 [0016] According to another aspect of this invention there is provided apparatus for 
identifying a network-wide set of paths potentially taken by packets in a communications 
network, comprising: 

a collector for collecting packets containing information indicative of the 
interconnection of the network, and of its interconnection with other networks; 

15 a detector for detecting the contents of the selected packets; 

an identifier for using the detected contents to identify the network-wide set of routers 
and sub-networks and their interconnections, that are traversed by communications within the 
network; and 

an output for providing an indication of any selected part of the network-wide set of 
20 routers and sub-networks and their interconnections. 

Brief Description of Drawings 

[0017] A method and apparatus in accordance with this invention, for identifying 
functionality of routers interconnected by communications links in a communications 
network, will now be described, by way of example, with reference to the accompanying 
drawings, in which: 

Figure 1 shows a notional fragment of the Internet; 
Figure 2 shows an illustrative network topology description; 
Figures 3 to 7 show the format of link state advertisements as defined in the OSPF 
protocol; 

Figures 8 to 13 show a procedure for deriving a network topology description; and 
Figure 14 shows the notional fragment of Figure 1 after failure to two routers 
within it. 

35 Detailed Description 

[0018] Referring to Figure 1, a notional fragment of the Internet is shown comprising an 
autonomous system AS1 and portions of two other autonomous systems AS2 and AS3 
connected to it. The system AS 1 contains two edge routers 1 0 and 12 which provide external 



25 



30 



WO 2004/012393 PCT/GB 2003/003308 



connections, to the systems AS2 and AS3 respectively, and three internal routers 14, 16 and 
1 8 which are connected solely to other routers within their own AS. The systems AS2 and 
AS3 likewise include edge routers 20 and 30 respectively, providing connection to the system 
AS1, as well as internal routers 22, 24, 30 and 34. 
5 [001 9] Each AS requires forwarding information, both local to the AS and global between 
ASs, so that data packets can be routed through the nodes or routers to the correct 
destinations. Between ASs the routers (and routes) are configured either statically or 
dynamically using a class of protocols called Exterior Gateway Protocols, e.g. the Border 
Gateway Protocol (BGP) described in RFC 1771. Within an AS the routers (and routes) are 
1 0 configured either statically or dynamically using a class of protocols called Interior Gateway 
Protocols (IGPs), such as OSPF, IS-IS or Routing Information Protocol (RIP). For 
convenience the following description will assume the use of OSPF, but the invention can be 
used in association with other protocols embodying analogous concepts and functionality to 
OSPF, including IS-IS. 

15 [0020] In a link-state routing protocol such as OSPF each router is responsible for 
distributing and maintaining a database describing the topology of an area or zone forming 
the whole or part of the AS containing that router. This database is known as the link-state 
database. On start up, the router is only aware of its own local state, its connected interfaces 
and networks in accordance with infomiation that is pre-configured by the router's 

20 administrator. The process of learning and distributing further network state information, 
such as connectivity, is achieved by exchanging special data packets defined by the OSPF 
protocol with other routers within the AS. 

[0021] Initially "adjacencies" are formed with neighbouring routers using, for example, 
packet multicast techniques. An adjacency is a relationship formed with each of a router's 

25 active neighbours for the purpose of exchanging routing information. Once an adjacency has 
been formed the adjacent routers exchange information about their state using OSPF link- 
state description packets formatted in accordance with the protocol. This process continues 
until both routers share a common view of the topology of their zone of the AS, thereby 
building a link-state database in each router. 

30 [0022] On completion of the adjacency forming process throughout the AS, each router in 
the AS executes the same algorithm in conjunction with its own copy of the link-state 
database, to construct a unique routing table comprising a tree of least-cost paths, as defined 
by the IGP metric, from itself as root to each destination. The resultant least cost paths 
become the default routes taken by all unclassified packets traversing the network. 

35 [0023] As noted above, sets of networks within the AS can be grouped together into routing 
areas or zones. The topology of a zone is not shared with the rest of the AS containing that 
zone, to provide a significant reduction in routing traffic. Between zones summary packets 
are exchanged to ensure inter-zone connectivity. 
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[0024] After the initial generation of its link-state database and routing table, each router 
repeats the information exchange and route calculation process if a change in its network 
zone occurs. A change might involve the addition or removal of a link or router, or a change 
in a link's costs. To avoid the possibility of the link-state database becoming stale the 
5 packets are, in the absence of new updates, re-broadcast periodically, normally every thirty 
minutes. 

[0025] The invention implements non-intrusive discovery of the network topology within 
an AS using a link-state IGP such as OSPF or IS-IS, and creation of an annotated 
representation of that topology to facilitate the subsequent discovery of a network-wide set of 

10 paths through that network. The annotated representation describes the AS by means of a 
directed graph, in which vertices represent routers or networks and edges represent links 
connected to the routers. The annotations indicate discovered data about the router or 
network represented by each vertex. In the case of routers the annotations indicate associated 
ff address, a set of interfaces denoted by IP address, and type or function (intra-zone, inter- 

15 zone or inter-autonomous system). For networks the associated network addresses and 
netmask, denoted by IP address, and network type (stub, transit or external) are shown. 
Transit networks are those capable of carrying data traffic that is neither locally originated 
nor locally destined. Stub networks are analogous to cul-de-sacs and external networks are 
destinations to other networks outside the AS. 

20 [0026] A visual representation of an example of a graph produced in accordance with the 
invention is shown in Figure 2. The edges of the graph connect the individual vertices. An 
edge connects two routers when they are attached via a physical point-to-point link whilst an 
edge connecting a router to a network indicates that the router has an interface on the 
network. Each edge is annotated with the cost of using that interface for packet forwarding, 

25 as defined by the IGP. In OSPF this is known as the link metric. 

[0027] One aim of the invention is to generate a topology with limited impact upon the 
normal operation of the network or the routers. The topology discovery process is non- 
intrusive in the sense that the required information is obtained by means of limited active 
interaction with the routers or other network elements and without generating significant 

30 additional network traffic. To this end and as shown in Figure 1 at least one probe or monitor 
40 is connected to the AS at a point where the OSPF packets are present. The probe could 
for example be a low-cost computer, such as a "personal computer'', running a dedicated 
software program and connected to the AS via an Ethernet card. The "logical" point of 
connection to the network is chosen to ensure that OSPF packets broadcast by the routers can 

35 be collected. Physically, this connection point may be, for example, a port on a router, or a 
tap into a link between two routers, from a sub-network via a hub or switch. In OSPF terms a 
connection is required at any point in the network traversed by OSPF packets. For these 
physical connections the software program in the probe 40 opens a connection in 
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"promiscuous mode" onto the network link or segment of the chosen network zone. 
Promiscuous mode allows the probe to receive the required OSPF packets irrespective of 
their LAN destination address. The received packets are allowed to continue their journey 
through the network without interference (rather than being received and removed from the 
5 network). Alternatively, OSPF and other IGP routing packets are also stored by the routers 
themselves in their own LSDBs and are available in raw byte format via the SNMP MIB for 
debugging purposes. Initially, and at each subsequent change in topology, the packets can 
also be collected from the OSPF MIB with limited network impact and overhead. Collection 
of the raw packets from a MIB requires a very low number of SNMP calls, rather than the 
10 multiple SNMP calls that are a feature of the existing methods of using SNMP. Changes to 
the MIB data can be tracked using SNMP traps. A SNMP trap, once set, will inform an 
external application of a change in the target MIB data. 

[0028] The probe 40 does not implement a state machine as described in RFC 2328 to 
establish an adjacency with any router, as that would require the probe to become an active 

15 participant in the OSPF routing protocol, thereby creating spurious link-state database entries 
in that zone's other routers. For collection of the packets by monitoring a link the probe 40 
remains passive and relies on the flooding process of OSPF packets by the routers in the zone 
or AS. The probe 40 waits for OSPF packets to arrive on the monitored interface, rather than 
requesting them using the normal OSPF mechanisms. A topology derivation procedure 

20 (described below and illustrated in Figures 8 to 13) is executed upon the receipt of every 
OSPF packet, to build up the desired topology description incrementally. The start-up 
procedure requires the default link-state refresh interval, normally thirty minutes, to have 
elapsed before a complete topology description is determined. Becoming an active 
participant in the OSPF protocol can accelerate the collection of the packets and reduce the 

25 time for the complete discovery process; obviously a side effect in this case is the increased 
impact upon the network and the routers. Alternatively, where the probe 40 collects OSPF 
routing data from raw packets contained within the MIB, a complete and up-to-date topology 
can be obtained with no start-up period required. This method also limits the impact upon 
the routers or network. Thereafter by continuing to track the OSPF packets the probe can 

30 keep the topology description in step with the state of the network. 

[0029] The number of probes required for an AS depends upon the size of the AS and how 
it is organised. A single probe can generate a complete annotated topology for the zone to 
which it is connected. An OSPF network always has at least one zone, which is known as the 
backbone. Connection to this backbone is preferred. Experience indicates that many 

35 networks are hierarchical in design and a single probe connected to the backbone will provide 
a very useful annotated topology. To discover a complete annotated topology for a multi- 
zone AS, a connection to each active zone is required. However, even a single connection 
will provide, in addition to the complete annotated topology of the chosen zone, summary 
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information for the networks in other zones in the AS, plus any connections to external 
networks via the AS's edge routers. 

[0030] Each probe 40 collects the packets and makes copies of selected types of packets 
described below. It then extracts data from these copies and processes the data to yield 
5 infonnation for the annotated topology. 

[00311 Fi ve types of packet are defined in the OSPF protocol, as shown in the following 
table. For the purposes of the present invention one of these OSPF packet types is used, the 
Link State Update packet, type 4. 
Type Description 
10 1 Hello 

2 Database Description 

3 Link State Request 

4 Link State Update 

5 Link State Acknowledgement 

15 [0032J Hello packets are also present on OSPF networks, for example on broadcast media 
such as Ethernets, and are transmitted most frequently, appearing at regular intervals on a 
given network segment. Hello packets can therefore be used to supply the probe 40 with an 
accurate indication of network time. An accurate time stamp is applied to Hello packets on 
their arrival at the probe. For example, a probe based upon a personal computer could obtain 

20 an accurate indication of time either from its internal clock or from a Global Positioning 
System (GPS) receiver in conjunction with the Network Time Protocol (NTP). Experience 
has shown that most network operators provide an accurate time service that can be used for 
this purpose. By measuring the inter-interval time of the Hello packets and storing the result 
an accurate internal representation of the passage of time driven by the normal OSPF packet 

25 sequence can be established. This form of timer mechanism is a convenient way of providing 
a timebase for ensuring obsolete information is purged from the probe 40. However, any 
other form of timer mechanism that can provide an accurate indication of passage of time 
will suffice. 

[0033] The Link State Update packets (OSPF type 4) contain one or more Link State 
30 Advertisements (LSAs), which describe the state of either a router (including the state of the 
router's interfaces and adjacencies) or a network. The collection of LSAs for a zone 
comprises the link-state database. Several types of LSA exist, as shown below, and each 
LS A type describes a different element within the AS or network zone. 
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LS Type Description 

1 Router-LSAs 

2 Network-LSAs 

3 Summary-LSAs (EP network) 
5 4 Summary-LSAs (ASBR) 

5 AS-external-LSAs 
[0034] LS As are broadcast whenever a change in the network configuration occurs, and at 
regular intervals to ensure that stale information is not present in the network. Each LS A has 
a header portion (shown in Figure 3) that contains both a key (comprising a combination of 

10 fields in the header) and age information that give a unique identity to the LSA within the 
AS. The process of determining if an LSA should be accepted into the link-state database is 
described inRFC2328, sections 13.1 and 13.2, and is used by the probe 40 to determine if an 
LSA it receives is newer than an existing LSA that it already has, and whether that LSA 
should be accepted into its own link-state database. 

75 [0035] As the probe's internal clock is updated the new time value is used to increment the 
age field of every LSA in the link-state database. If an LS A's age value thus becomes greater 
than the OSPF standard architectural constant MaxAge, conventionally set to one hour, the 
LSA is removed from the link state database (as shown in Figure 8). This process provides a 
safeguard ensuring that stale LSAs are removed from the probe's link-state database, so that 

20 if an updated LSA is missed by the probe or lost due to a temporary link failure, the topology 
description provided by the probe 40 will not be unduly corrupted. 
[0036] When the probe has first assembled its link-state database, and after subsequent 
changes in the probe's link-state database are detected to have occurred (e.g. following 
receipt of a new or updated LSA), the probe's annotated description of the current network 

25 topology must be created or refreshed. The procedure for accomplishing this will now be 
described, with reference to Figures 8 to 13. The precise sequence of most of the steps 
involved is not critical, although step 6 must be performed last. Equally, the topology could 
be entirely re-calculated for every link-state database change, or just incrementally in respect 
of the most recent LSA changes processed. Both approaches are equally valid and the 

30 method that proves simpler to implement or more appropriate in a specific implementation 
can be chosen. Li the example described below identifying the vertices of the topology first 
is convenient and conforms to normal graph construction techniques. 
[0037] Step 1 (Figure 9): Identify the active sub-networks within the zone and the active 
routers in those sub-networks; this is accomplished using the LSAs that contain information 

35 about the network elements within the current zone, specifically Type 2 Network-LSAs and a 
subset of Type 1 Router-LSAs. Network-LSAs specify the routers that are attached to a sub- 
network that supports more than one router. The Network Mask field in aNetwork-LSA (see 
Figure 5) describes the size, or range, of the address space of the sub-network, and the IP 
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address in the Link State Identifier field of the LSA's header (Figure 3) identifies the first IP 
address in the sub-network. Lists of active routers on that sub-network are also provided, the 
routers being denoted by IP address in the Attached Router field (Figure 5). Each LSA 
contains one entry for each and every active router on the sub-network. 
5 [0038] Router-LS As can be sub-divided depending upon the type of link being described, 
and each Router-LS A may describe several links of different types. The types of connections 
are identified as follows: 



Type Description 

1 Point-to-point connection to another router 

10 2 Connection to a transit network 

3 Connection to a stub network 

4 Virtual link 



Only those Router-LSAs containing information on type 3 links to a stub network are 
considered in this step. For each Router-LS A describing connections to stub networks, each 

75 Link Identifier field (Figure 4) and the following Link Data field give the IP address and 
network mask for a connection to a stub network on the router identified by the Advertising 
Router field of the LSA's header. If the penultimate router on a sub-network fails so that the 
sub-network no longer has two or more routers the corresponding Network-LS A may not be 
actively withdrawn from the link-state database. In this situation although the Network-LS A 

20 is still present, it is superseded by a new type 3 Router-LS A containing an entry describing a 
connection to a stub network. Therefore, in order to ensure only active routers on active sub- 
networks are considered in this step, the information contained in these two types of LSAs 
are combined so that a router defined by an entry in a type 3 Router-LS A takes precedence 
over information about the same router defined in a Network LSA. 

25 [0039] Step 2 (Figure 9): Specify the topology's internal network vertices. A vertex is 
created for each active network in the list of active sub-networks derived in step 1. The 
vertex is annotated with the IP address and the network mask thus specifying the identity and 
address range of the sub-network represented by the vertex. These vertices are also annotated 
with the type 'internal network'. 

30 [0040] Step 3 (Figure 9): Specify the topology's router vertices and their associated 
interfaces: 

Step 3.1 : The type 2 Router-LSAs containing entries describing the connections to 
transit networks are analysed. These LSAs describe routers that have connections to sub- 
networks that have more than one entry/exit point. For each LSA the IP address of the router, 
35 identified by the Advertising Router field in the LSA's header, is added to the list of vertices. 
A list of active router interfaces identified by the IP address in the Link Data field is 
associated with the vertex entry. In this context an interface on a router is a synonym for the 
port to which a network connection or link is made. 
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[004 1 ] Step 3 .2 : The type 1 Router-LS As containing entries describing links that are point- 
to-point connections are analysed. As before the router DP address is added to the list of 
vertices and the IP addresses of the router's interfaces identified by the Link Data field are 
also added. 

5 [0042] Step 3.3: A similar process is employed for the type 4 Router-LSAs containing 
entries describing virtual links (virtual links are described in RFC 2328, sections 3.1 and 1 5). 
[0043] Step 3 .4 : Next the type 3 Router-LSAs, containing information about connections to 
stub networks, are analysed. The process is the same as that for type 2 Router-LSAs 
containing entries describing the connections to transit networks. However, in this case the 

10 router address itself is added as the associated router interface. The Link Data field for this 
type of Router-LSA entry does not describe the router's interface, but describes the network 
mask of the connected stub network. The IP address interface for die router's interface cannot 
therefore be determined. For the purposes of specifying the connections, as described later, 
the start point for this type of link is considered to be the router itself. 

15 [0044] Step 3.5 (Figure 10): The specified router vertices are annotated with their 
associated types. The router types are marked according to the E and B flags in the VEB 
field of the Router-LSA (Figure 4). If the B flag is set then the router is marked as inter-area; 
if the E flag is set the router is marked as inter- AS or inter-network; otherwise the vertex is 
marked as intra- area. If the V flag is set the vertex is the end-point for one or more virtual 

20 links. 

[0045] Step 3.6 (Figure 1 1): For each ASExternal-LSA a router vertex is added, if it does 
not already exist, as identified by the Advertising Router field of the LSA header, with an 
associated interface as identified by the TP address in the 'Forwarding Address' field (Figure 
7). The vertex is annotated as an 'inter- AS router'. Similarly, for each Summary-LSA a 

25 check is made that a vertex exists for the router identified by the IP address in the Advertising 
Router field, and that it is annotated as being an 'intra-area router'. This step has two 
purposes: to check the integrity of the data and to speed the discovery process on probe start- 
up, during the period where a complete topology has not yet been collected. 
[0046] Step 4.1 (Figure 1 2) : Specify the topology's inter-area network vertices. For this the 

30 type 3 and type 4 Summary-LS As are considered. These LS As describe connections to inter- 
area destinations comprising either networks (for type 3 Summary-LS As) or inter-area routers 
(for type 4 Summary-LSAs). For each Summary-LSA, of either type, a network vertex 
identified by the IP address Advertising Router field and the Network Mask field (Figure 6) 
is added to the list of vertices. These vertices are annotated with the type 'summary 

35 network'. 

[0047] Step 4.2 (Figure 1 2): Specify the topology's inter-AS network vertices. The type 5 
ASExternal-LSAs are used to specify a set of external vertices that represent routes to 
networks external to the network containing the probe 40. These are routes whose existence 
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has been made known via either pre-configured static route descriptions or via an Exterior 
Gateway Protocol such as BGP-4. For each of these external routes the OSPF routers will 
issue an ASExtemal-LS A. For each LSA a vertex is added to the vertex list for the network 
identified by the IP address in the Link State Identifier field of the LSA's header and the 
5 Network Mask (Figure 7). The vertex is annotated with the type 'external network'. 
[0048] Step 5 (Figures 12 and 13): Specify the edges in the network: 

Step 5.1: Specify the transit edges. Type 2 Router-LSAs containing entries 
describing connections to transit networks are used to specify edges in the graph that 
interconnect vertices representing routers to any vertices representing networks that offer a 

1 0 through or transit service. (A transit network is one that has two or more separate entry/exit 
points.) For each Router-LSA containing an entry that describes a transit connection to a 
network, an edge is specified in the evolving topology description from the router interface 
defined in the Link Identifier field (Figure 4) to the sub-network defined by the Link Data 
field. (According to RFC 2328 "when connecting to an object that also originates an LSA 

15 (i.e., another router or a transit network) the Link Identifier is equal to the neighbouring 
LSA's Link State Identifier".) Therefore the sub-network with the corresponding Network- 
LSA 'Link State Identifier' is used to determine the endpoint for the edge being specified. 
The edge is annotated with the cost of traversing the link as defined in the Metric/Cost field. 
It is important to note that there could be more than two edges connected to the sub-network 

20 vertex. 

[0049] Step 5.2: Specify the stub edges. Router-LSAs containing entries that describe 
connections to stub networks are used to specify edges between the relevant router vertices 
and network vertices with only one entry and exit point. For each type 3 Router-LSA 
containing an entry that describes a connection to a stub network, an edge is added starting at 

25 the router's interface; in this instance the interface has the same address as the router itself 
(in effect addressing the router directly) and ending at a sub-network. The edge start is 
denoted by the Adverting Router field and the destination is the sub-network as defined by 
the Link Identifier and Link Data fields. The sub-network address is denoted by the IP 
address in the Link Identifier field and the network mask by the Link Data. The edge is 

30 annotated with the cost of traversing the link defined in the Metric/Cost field. 

[0050] Step 5.3 : Specify the point-to-point edges. Router-LSAs describing point-to-point 
and virtual links are used to specify edges that directly interconnect router vertices. Virtual 
links are described in RFC 2328 sections 3.1 and 15 and for the purposes of generating a 
topology they can be handled in the same way as point-to-point links. For each Router-LSA 

35 containing entries that describe either point-to-point connections to another router, type 1 , or 
virtual links, type 4, an edge is added to the evolving topology. The edge starts at the router 
interface denoted by the IP address in the Link Data field and the destination router denoted 
by the EP address in the Link Identifier field. The edge is annotated with the cost of 
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traversing the link defined in the Metric/Cost field 

[0051] Step 5.4: Specify the inter-area edges. Summary-LSAs are used to specify edges 
connecting router vertices to vertices describing any inter-area destinations. There are two 
types, type 3 which describe destinations that are IP networks and type 4 which describe 
5 destinations that are other inter-area routers. For each type 3 Summary-LSA an edge is added 
from the router's interface (in this instance having the same address as the router itself, in 
effect addressing the router directly) to the inter-area sub-network as defined by the Link 
State Identifier field and the Network Mask field. For type 4 Summary-LSAs the Network 
Mask field is not meaningful and must be zero, and the Link State Identifier is the DP address 
JO of the inter- AS router. In both cases the edge is annotated with the cost of traversing the link 
as defined in the Metric/Cost field (Figure 6). 

[0052] Step 5.5: Specify the inter- AS edges. ASExternal-LSAs are used to specify edges 
connecting router vertices to vertices describing any external destinations outside the AS. 
For each ASExtemal-LSA an edge is added from, the routet interface denoted by the 

15 Forwarding Address field (Figure 7) to the external network defined by the Link State 
Identifier field in the LSA's header and the Network Mask field (Figure 7). The edge is 
annotated with the cost of traversing the link defined by the E bit field and the Metric/Cost 
field. If the E-bit is unset then the metric, or cost, is defined in the same units as the other 
internal link metrics of the other edges. If the E-bit is set then the cost of the link is 

20 considered larger than any other internal link state path. 

[0053] Step 6 (Figure 1 3) : Maintain a graph of viable paths. The probe 40 must eliminate 
any out-of-date information, thus ensuring that only viable network paths are reported to a 
traffic-management or other application using the topology information. For example, there 
is a possibility that the probe's link state database may contain LSAs that arrived prior to a 

25 network outage or failure that caused a partition in the network. Fresh LS A updates from any 
router that resides on the network on the far side of the partition failure point will not have 
been able to reach the probe 40 where they would be used to remove the stale information. 
To maintain an accurate topology description the probe 40 must eliminate the vertices and 
edges representing affected routers, networks and links. 

30 [0054] For example, in the scenario of Figure 1 the routers 1 6 and 1 8 might crash owing to 
a power failure. Figure 14 shows the resultant network configuration. The probe 40 will 
continue to receive updates from the routers 10 and 14 on its side of the failure or 'network 
partition'. However the router 12 lying beyond the partition cannot communicate the change 
to either of the routers 1 0 and 14. Consequently the link-state databases in the routers 1 0 and 

35 14 and the probe 40 will continue to contain LSAs sent from router 12. However the 
information in these LSAs can no longer be considered reliable as it is from outside the 
probe* s current known routing zone. The purpose of the probe 40 is to create a description of 
all possible and viable network paths, so this description should not include portions of the 
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network beyond the point where the failed routers 16 and 18 are situated. 
[0055] The reach- ability of each vertex in the graph is assessed by systematically inspecting 
all the vertices using a recursive procedure starting at the vertex representing the point where 
the probe 40 is connected to the network. There are a number of well-known procedures for 
5 determining reach-ability in graphs based upon, for example, 'breadth first search' and 'depth 
first search' algorithms. Only the connected edges to each vertex are considered in these 
algorithms so the disconnected portion of the graph, in this example routers 16 and IS, will 
be eliminated and the result is a 'connected graph' eliminating paths from router 12 to router 
30 and beyond from the results. 

10 [0056] It is important to note that the start vertex for the recursive algorithm could either 
be a router or a network depending upon which element is logically closest to the probe 40. 
For example, if the probe is connected directly via a tap on a point-to-point connection then 
the start point is the logically nearest router. Or, if the probe is connected on a transit sub- 
network then the sub-network should be used. If the probe is connected to the MIB on a 

15 router then the router should be used. A transit sub-network provides multiple reachable 
connections to the overall network via each of its connected routers, and these multiple 
connections must be considered during any reach-ability calculations. The resultant graph 
can be exported to an external application (e.g. for traffic management) and includes only the 
active interconnected vertices currently known to the probe 40. 

20 [0057] The graph can be exported by the probe 40 to one or more external applications via 
an appropriate form of inter-process communication. For example, the known Remote 
Procedure Call mechanism (RPC) or the mechanisms described in standards for Common 
Object Request Broker Architecture (CORBA) or Java Remote Method Invocations (RMI) 
may be used. The software for the probe 40 could also be embedded directly into application 

25 software to create a simple, small, lightweight, portable system that could be transported 
around the network by the operator as required. 

[0058] The annotated graph data could be made available so that an application is made 
aware of each change as it occurs. This is sometimes referred to as a 'publish and subscribe' 
mechanism whereby the application subscribes to the changes as they are published. 
30 Alternatively, and more simply, a new topology could be delivered to an application on 
demand. 

[0059] The exported topology information could take several forms but will include some 
type of listing of the active vertices and active edges. The listing of active vertices typically 
includes: the vertex identity denoted by the IP address and network mask or prefix length; the 
35 type of network element represented by the vertex, for example, network, inter- AS, inter- 
zone and intra-zone, and the intra-zone and the zone to which they belong. Also included 
could be the list of interfaces, denoted by IP address. The list of edges typically includes the 
vertices to which they are connected, denoted by IP address and network prefix and the cost 
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or metric of using the link represented by the edge. Also included in the list of edges could be 
the interface used on the router, also denoted by IP address. 

[0060] If an application requests changes as they occur, using a 'publish and subscribe 
mechanism, then edges and vertices that become inactive or a network change can be 
5 removed from the topology description by specifying the vertices in terms of IP address and 
network prefix. Similarly edges to be removed can be identified in terms of the two 
connecting vertices. When informing the application that an edge or vertex is no longer 
active the annotation information, such as edge metric or vertex type, can also be supplied 
but is not strictly necessary. 

10 [0061] An application makes use of the annotated graph data to determine the set of current 
active logical paths by, for example, extracting a list of those vertices providing any inter-AS 
connectivity. Vertices of this type comprise the ingress and egress points of the network and 
are the most likely places where traffic classification would be applied as described above. 
The application can directly query the routers represented by those vertices to determine if in 

15 fact traffic classification is present; one common mechanism that could be used for such a 
query is SNMP (RFC 1157). SNMP and the associated Management Information Bases 
(MIBs) for the chosen traffic management systems are available on the majority of routers 
and provide a widely accepted mechanism for access to this type of network management 
data. The internal router vertices could also be searched if there is a likelihood of any 

20 internal traffic classification being present on the network. This is less likely but in some 
situations may occur. 

[0062] If traffic classification is being used, for example, to route the traffic from a given 
provider along a path other than the default least-cost path, via an MPLS Label Switched 
Path (LSP), the external application can request that the router return information about the 
25 actual non-shortest path currently in use. SNMP again can be used to retrieve this path 
information including any transmission characteristics, for example the reserved bandwidth, 
that have been assigned to the logical path. 

[0063] The discovered topology data can be used to determine the network-wide set of 
paths, including the set of default paths for the topology. It is important to note that multiple 

30 logical paths from different source routers may potentially traverse a single interconnection. 
As a consequence the network-wide set of paths must be considered when determining 
alternative logical paths. Failure to consider the network-wide set of paths may lead to over- 
specification and congestion on a router, sub-network or interconnection that services 
multiple logical paths from different source/ingress routers. The network-wide set of paths is 

35 required to ensure the validity of these calculations. To determine this set of paths use is 
made of recursive procedures for performing traversal of a graph based upon 'breadth-first 
search' or 'depth first search* and Dijkstra's algorithm (described in RFC 2328); these 
provide the set of network- wide paths, including the shortest paths, for each combination of 
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ingress/source and egress/destination router. The inputs to the algorithms are the IGP cost 
metrics and the discovered graph data about the routers, sub-networks and interconnections. 
[0064] The network operator can, for example, use the combined information, including the 
set of network-wide paths and their associated costs/metrics through the AS, in conjunction 
5 with the overlaid requested traffic management information (about the LSP) to monitor the 
logical non-default path deployment- This combined information provides a valuable aid to 
the network operator, for example in designing new paths, LSP provisioning, and ensuring 
that the network is performing to design specification. 

[0065] For example, by comparing the network-wide paths, the default paths, the active 

JO logical paths and the routing objectives associated with the active paths it is possible to 
generate a set of alternative logical paths that would conform to the routing objectives 
associated with the active logical paths. Referring to Figure 1 (and assuming for simplicity in 
this example that the IGP path cost is analogous to the number of routers traversed and that 
all links have an equal maximum capacity), the application will calculate that the default path 

15 from the router 10 to the router 12 is via the router 1 S. An active MPLS LSP is discovered 
between the routers 1 0 and 1 8 that requires a reserved bandwidth equating to 75% utilisation 
of the link maximum capacity. This LSP has been installed at the request of the manager of 
AS3 who requires a guaranteed level of bandwidth for connection to AS2. A second path is 
also discovered between the routers 1 0 and 1 2 via the routers 1 4 and 1 6, that is being used for 

20 another purpose; this path requires 20% of the bandwidth on those links. The application 
determines that the first LSP is on the default path, and that the combined load of the first and 
second LSPs does not equate to more than the available maximum capacity. The application 
can therefore recommend that an alternative path for the first LSP would be via the router 14 
and the router 16, rather than via the default path through the router 18. 

25 [0066] Utilising a mechanism to inform the external application of any change to the LSP, 
for example caused by loss of an internal transit network owing to link failure, may help the 
operator to mitigate the impacts of such a failure by providing an immediate warning of the 
LSP change. One such mechanism that allows routers to provide feedback is the SNMP trap 
mechanism. An SNMP trap, once set, will inform an external application of a change in the 

30 target MB data. The new LSP, or any changes to the characteristics of the LSP, can then be 
overlaid over the changed topology once again providing near-real time feedback of LSP 
behaviour. 

[0067] The annotated topology provided by the probe 40 is therefore able to assist operators 
in various network management tasks including those described above. The described 
35 process could also be applied, but is not limited to, other forms of traffic management and 
other technologies that employ routing over separate logical paths via packet classification at 
ingress and egress routers, as alternatives to typical least-cost path routing, such as 
Differentiated Services, Virtual Private Networks (VPNs), Voice over IP, SLAs and QoS 
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mechanisms. 



